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Abstract 

We study multistep Bayesian betting strategies in coin-tossing games in the 
framework of game-theoretic probabihty of Shafer and Vovk (2001). We show that 
by a countable mixture of these strategies, a gambler or an investor can exploit 
arbitrary patterns of deviations of nature's moves from independent Bernoulli trials. 
We then apply our scheme to asset trading games in continuous time and derive the 
exponential growth rate of the investor's capital when the variation exponent of the 
asset price path deviates from two. 
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1 Introduction 

The field of game-theoretic probability and finance established by Shafer and Vovk [12] has 
been rapidly developing in many directions. The present authors have been contributing 
to this exciting new field by focusing mainly on explicit strategies of the gambler and the 
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growth rate of his capital (IH], [15], [6], [7], [8], [13], [5], [IS]). Following the terminology 
of Shafer and Vovk [12j, we refer to the gambler as Skeptic or Investor (Section H]) and 
refer to nature as Reality or Market (Section Hj). 

In this paper we extend the results of [7] and [TB] by considering multistep Bayesian 
strategies. In [7] we considered a class of Bayesian strategies for Skeptic in coin-tossing 
games, strategies that were based only on the past average of Reality's moves. We proved 
the important fact that if Skeptic uses a Bayesian strategy and Reality violates the strong 
law of large numbers (SLLN), then the exponential growth rate of the Skeptic's capital 
process is very accurately described in terms of the Kullback divergence between the 
average of Reality's moves when she violates SLLN and the average when she observes 
SLLN. Furthermore in [16] we applied Bayesian strategies for coin-tossing games to asset 
trading games in continuous time. If we discretize a continuous-time game by an equi- 
spaced grid in the state space (i.e. the vertical axis), then the continuous time game can 
be approximated by an embedded coin-tossing game. Thus the results of discrete time 
coin-tossing games can be applied to continuous-time games. In particular we gave a 
proof of "-y/di-effect", i.e.. Investor can force Market to choose a price path with variation 
exponent equal to two, within an arbitrary small constant. 

More generally, discretization of continuous-time game in [16] is based on the re- 
quirement that Investor choose a countable number of discrete stopping times against 
a continuous path chosen by Market. This approach allows us to formulate and study 
continuous-time games in game-theoretic probability within the conventional theory of 
analysis, whereas in the book by Shafer and Vovk continuous-time games were formu- 
lated as limits of discrete time games using nonstandard analysis. Vovk has taken up this 
formulation and is currently rapidly developing it in [IT] , [18] , [19] . By these works of Vovk 
it has now become clear that many measure-theoretic results on continuous-time stochas- 
tic processes can be more directly derived in the framework of game-theoretic probability. 
It should be emphasized that the game-theoretic approach is advantageous because no 
probabilistic assumptions on the paths are imposed a priori. Instead, a stochastic behav- 
ior of Market results from the protocol of the game. This has a far-reaching conceptual 
implications for the emergence of probability. 

In the coin-tossing games Reality may deviate from independent Bernoulli trials in a 
subtle way without violating the strong law of large numbers. For example. Reality could 
choose a deterministic sequence where heads and tails alternate. Then SLLN holds for this 
kind of paths and they can not be prevented by Bayesian strategies in [7]. We could use 
"contrarian" strategies ([5]) based on the past average of Reality's moves for preventing 
this kind of paths. However a more natural approach is to model autocorrelations between 
successive moves of Reality. Note that the deterministic sequence of alternating heads 
and tails can be regarded as a sequence with the first-order autocorrelation of —1. These 
considerations naturally lead to multistep Bayesian strategies of the present paper. Then 
by a countable mixture of these strategies Skeptic can detect and exploit arbitrary patterns 
of deviations from the sequence of independent Bernoulli trials. 

Our results have close relations to the ones in many fields. Universal source coding 
has been extensively studied in information theory (e.g. Han and Kobayashi [4j) and the 
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equivalence of source coding and betting is discussed in Cover and Thomas [T]. Various 
notions of randomness have been studied from the viewpoint of Kolmogorov complexity 
(e.g. Li and Vitanyi [10], Lambalgen [9]) and there exists an extensive literature on algo- 
rithmic theory of randomness. See two forthcoming books on algorithmic randomness by 
Downey and Hirschfeldt [2] and by Nies [TT] . We discuss these relations in Section O 

The organization of this paper is as follows. In Section [2] we set up notations for 
the coin-tossing game and give some preliminary results. In Section [3] we consider two 
types of Reality's moves which suggest deviations from independent Bernoulli trials. The 
first is a block type pattern and the second is a Markovian type pattern. We construct 
Skeptic's Bayesian strategies which can exploit these non-randomnesses. We show that 
it is asymptotically always advantageous to exploit higher order patterns in Reality's 
moves. In Section H] we consider asset trading games in continuous time and investigate 
the consequences of high-frequency block type strategies and Markovian strategies. We 
derive the exponential growth rate of Investor's capital process when Market chooses a 
path with variation exponent not equal to two. Finally in Section O we discuss some 
aspects of our results and their implications for related fields. 

2 Preliminaries on coin-tossing games and Bayesian 
strategies 

In this section we summarize preliminary results on coin-tossing games and capital process 
of Bayesian strategies ([12], [Z])- We also discuss a one-to-one correspondence between 
the set of probability distributions on the set of Reality's paths and the set of Skeptic's 
strategies. Finally we note the convexity of the Kullback divergence with respect to its 
first argument. 

In this paper we consider the coin-tossing game in the following form. In the protocol 
the success probability < p < 1 is given. 

CoiN-TossiNG Game 
Protocol: 
/Co := 1. 

FORn= 1,2,...: 

Skeptic announces M„ G M. 
Reality announces x„ G {0, 1}. 

END FOR 

If we write M„ = — M^, then Mn{xn — p) can be rewritten as 

M„(x„ - p) = M^(x„ - p) + M° ((1 - x„) - (1 - p)) . (1) 

In this case we say that Skeptic bets on x„ = 1 and M° on = 0. Although ([1]) is 
a redundant expression, generalizations to multistep protocols in the next section can be 
more transparently understood in this form. A path ^ = X1X2 ... is an infinite sequence of 
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Reality's moves and the sample space H = {,^} = {0, 1}°° is the set of paths. C,"' = xi . . . Xn 
denotes the partial path of Reality's moves up to round n. Throughout this paper we use 
the notation 

Xi ~|- ■ ■ ■ Xjii Xfi . 

n 

Skeptic's strategy P is a set of functions V : ^"^^ ^ Mn which determines Skeptic's 
move Mn at round n based on Reality's moves up to the previous round ^"""^ = xi . . . Xn-i- 
Given a strategy V 

n 

^n(0=/Co + 5^M,(r')(x,-p) 

i=l 

denotes Skeptic's capital process when he uses V, starting with the initial capital of 
/Co = 1- Following the terminology in [17] we call V prudent if /C^(0 ^ for all ^ and n, 
i.e., Skeptic's capital is never negative irrespective of the moves of Reality. We also say 
that Skeptic observes his collateral duty if he uses a prudent strategy. In this paper we 
require that Skeptic's strategies are prudent. 

We consider probability distributions on the set of paths S = {0, 1}°°. In our frame- 
work a probability distribution Q on S is just a collection of consistent discrete proba- 
bility distributions Q = {Qn]n > 1}, where Qn is a discrete probability distribution on 
S„ = {0, 1}" satisfying the consistency condition 

Q4C) = Qn+iiCo) + Qn+iiCi), Vn, vr. (2) 

Note that we do not need measure-theoretic extension of Q to a probability measure on 
a (T-field of S. Under the distribution Q, the conditional probability of = 1 given 
^n-i _ _ _ _ Xn-i with Qn-ii^""^^) > is Written as 

If (5n-i('C"~^) = 0, then is not defined. We call the probability distribution of i.i.d. 
Bernoulh trials with success probability p the risk neutral measure of the coin-tossing 
game. 

Given a probability distribution Q, define a strategy V = Vq by 

Vq : ^ M„ = /C„_i-^|^. (4) 

A motivation of this definition is given in Appendix. If we write Mn = Ml- M° as m 
([I]), then M^, M° are given as 

M^ = /C„_A, M° = /C„_i— (5) 
P 1 -P 

The capital process of Vq is explicitly written as follows ((TJ Theorem 4.1]). 
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This is the hkehhood ratio of Q to the risk neutral measure at the reahzed path up 
to round n. In Appendix, we estabhsh a one-to-one correspondence between the set of 
probabihty distributions on S and the set of prudent strategies. Therefore capital process 
of a prudent strategy can always be expressed as (jH]). 

In particular if we employ a beta-binomial distribution 

T [a + b + n)T [a)T [b) a + b + n — 1 

where a, 6 > are hyperparameters of the prior distribution, then by a simple application 
of Stirling's formula, the resulting capital process denoted by /C° behaves as 

log/C° =nD(x„||p)-0(logn), (8) 

where 

p 1 — p 

D{p\\q) = plog- + (1 -p) log- 

q l-q 

is the Kullback divergence between two scalar probabilities p and q. Hence if Xn deviates 
from p, then D{xn\\p) gives the average exponential growth rate of the capital process. 
We call D{xn\\p) the main growth rate (or simply the growth rate) of the log capital. 

In the subsequent sections we often use the convexity of Kullback divergence for prob- 
ability vectors with respect to its first argument. Let p = {pj}'^^^ and q = {qj}j=i be 
probability vectors and let 

k 

D{p\\q) = 2_]Pilog — 

j=i 

denote the Kullback divergence between p and q. Let pi = {pj^}^^]^, i = 1,2, be proba- 
bility vectors and for < A < 1 let p = Api + (1 — X)p2 = {pj}^=i. Then the following 
relation is easily obtained 

\D{p,\\q) + (1 - \)D{p2\\q) - D{p\\q) = \D{p,\\p) + (1 - \)D{p2\\p) > 0. (9) 

The left-hand side can also be written as 



XD{p4q) + {1- X)Dip2\\q) - D{p\\q) = j^p^ol^ 

■i=^ ^ Pi 



A > 0. 



3 Priors for higher order patterns and multistep strate- 
gies 

As discussed in Section [T]there may be some deviating patterns from independent Bernoulli 
trials in the Reality's moves xiX2 ■ ■ ■ , although the path ^ = xiX2 ■ ■ ■ satisfies SLLN 
lim„^oo Xn = p. The strategy considered in [7] is based only on s„ and it can not exploit 
these patterns. Skeptic can increase his capital by strategies exploiting patterns not re- 
flected in Xn- In the following we investigate two types of such non-randomness or higher 
order patterns. The first is the block type pattern and the second is the Markovian type 
pattern. We give multistep Bayesian strategies which effectively exploit these patterns. 
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3.1 Block patterns 

For clarity of presentation, we first consider sequence of pairs (i.e. blocks of length 2) and 
later generalize the results to blocks of arbitrary length. 

Consider the sequence of pairs {xiX2){xsX4) . . . {x2n-iX2n) of Reahty's moves and de- 
note the number of the pairs (11), (10), (01), (00) among the first n blocks by m}^, m}!^, 
m'^, m°°, respectively. If the sequence is random, i.e., the Reality's moves are i.i.d. 
Bernoulli trials with success probability p, then we will have 

n—>-oo n 

where p^^ = (p)^, p^^ = = p(l — p), p°° = (1 — p)^- We construct a strategy for which 
limsup„ /C„ = oo whenever limsup„ |m^/n — p*-' ! > at least for one (i, j). 

For this purpose, at the (2n — l)-th round (n = 1, 2, . . . ) Skeptic chooses four amounts 
M}^\ , M^^ and bet them on (x2„-iX2„) = (11), (10), (01), (00), respectively. 
Then we have 

/c2„ = /c2n-2+ Yl K'i^ii-p'n, 

i,je{0,l} 



where 



— X2n-1 X X2n 



1, if (a;2n-ia;2n) = (H) 
0, otherwise 



and other z^^ , i,j = 0, 1, are defined similarly. Thus we define a derived capital process 
/C* = )C2n with the protocol 

ic: = ic:_,+ J2 Mii{4-f^^), n = i,2,..., (/c* = i). 

i,ie{o,i} 

As a natural generalization of the beta-binomial distribution treated in [7J, let us take 
the Dirichlet-multinomial distribution as Q(^^"^) = Q{zi. . . Zn). Then the corresponding 
strategy is given by (cf. ^ and ([7])) 

where c*-^'s are positive hyperparameters of the Dirichlet prior and jgjQ c*-' = c. The 
capital process /C* = /C^^ for this strategy is given by 



Q(e'") _ n.,,r(mtf + c^^)/r(c^. 



n.,,(p^^)"^" (r(n + c)/r(c)) w^M'y^ 

r(c) n,,r(m^^- + c-^) 
n„.r(c^Or(n + c) n„-(p^^>"' 
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where in the products i,j range over {0, 1}. 

We evaluate the asymptotic behavior of this capital Denote rrtl/n = p^, i,j = 0, 1. 
Then as in (18]) we have 



log/C: = n J2 P^^log4|-0(logn). 

ijG{0,l} ^ 



Hence for even n the original capital process JCn'^ = /C*^2 is written as 
hglC^^ = log/C:/2 = -D{{p^} II {p^n) - 0(logi 



Now if we neglect the pairwise block patterns and apply the strategy of [7] based on 
Sn only, then corresponding capital process JC^ behaves as ([8]). We compare JCn'^ and /C°. 
By (Q with A = 1/2 we have 

log/C^'^-log/CO = |D((p^\p^o,p°\p°°) II (pi\pf,P°\p°°))-0(logn), (10) 

where pl^ = {xn^ypl^ = = Xn{l — x„),p°° = (1 — XnY- We see that Kn'^ exploits 
the pairwise block type non-randomness more effectively than A^^ by the amount half the 
KuUback divergence given in the right-hand side of (fTOj) . 

So far we have only considered even n. This is sufficient for analyzing the asymptotic 
behavior of JCn'^. For completeness we discuss /Cn'^ for odd n. We can decompose Skeptic's 
bet on z^^ = 1 to the (2n — l)-th round and the 2n-th round as follows. 

1) at the (2n - l)-th round, Skeptic bets pM^^ + (1 - p)M^° on X2n-i = 1 and pM^^ + 

(l-p)MOO on X2n-1 = 0, 

2a) if X2n-i = 1, then at the 2?2-th round he bets M^^ on X2n = 1 and on X2n = 0, 
2b) if X2n-i = 0, then at the 2n-th round he bets on X2n = 1 and on X2n = 0. 

Denote = J2j=o^n — X]j=o'^*"'' = 0, 1. Then the capital at an odd round 

K^n+i is written as follows. 



r^Q -r'^Q ^ \ ("^n^ + ^^^)/ (^(^ - 1 + C)), if X2n+l = 1 



•2n ^ 




f ci+ 
f cO+ 


■2n ^ 


r(c)r( 


mi+ 


P^2n + l(l 


-P) 



= 

3;2n+l) 



pX2n+i(^l _ p)i-3;2n+ir(n + c) F (m^ + c^+)T{ml+ + c°+)' 



We can construct a similar strategy for the sequence of pairs {X2X3) . . . {x2nX2n+i) and 
can also combine these two strategies by splitting the initial capital into two equal parts 
and applying the corresponding strategy for each of them. Let JC^ = KT'^ denote the 
capital process considered so far and let denote the similar capital process based 
on {x2X^){x4^X5) . . . . Then the capital process of the combined strategy is written as 
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K, — (l/2)(/C^ + KP). Hence limsup„/C„ = oo, if the relative frequency of consecutive 
pairs {xin^\Xin)i i^^in'^^n^x) ■, taking the value {ij) does not converge to {p'''/2} for some 

We have discussed blocks of length two for notational simplicity. The above deriva- 
tion can be extended to n blocks of consecutive A:-tuples, and hereafter we outline the 
procedure. Let x\^x\^ . . . ^x\\)% the first n blocks of /c-tuple with 

"^m ~ (^fc(m-l)+l) '^k{m-\)+2-, ■ ■ ■ : Xkm): TU — 1, . . . , n, 

and let m^*' , = ei . . . e^, = 1 or 0, denote the number of the consecutive /c-tuple 
among the first n blocks. Also denote 

If the sequence is random, we will have liran^^mfl' /n = p^* for all G {0, l}'^. 

At the k{n — 1) + 1-st round (n = 1, 2, . . . ), Skeptic chooses 2*^ amounts M^* and bet 
them on a?^ = e^. Then we have 

where 

" 1 0, otherwise. 
Thus the derived game /C* = /Cfc„ is defined with the protocol 

E M^^{z:t-P''),n = 1,2,..., (/CS = 1), 

efce{0,l}'= 

where Ee,e{o,i}fc = 1- 

If we take the Dirichlet-multinomial distribution as = Q{zl'' . . . z^), then the 

corresponding strategy is given by 

where Vc*^* > 0, Yleke{o i}*^ ~ hyperparameters of the Dirichlet prior. The 

capital process for this strategy is 

r(c) Uejimt.'+c'") 



We evaluate the asymptotic behavior of this capital. Denote /n — p^. Then 



\oglCl = nD{{pl-}\\{p^-})-0{\ogn)=n ^ P^'^ log ^ - O(logn). (11) 



Hence in the context of the original game, for n which is a multiple of k, we have 



iogx:^« = iog/c: 



n 



^(R'=}||{p'n)-0(logn). 



■n/k 



k 



Similar strategies can be constructed for the n blocks of the fc-tuples with shift a 



We next combine k kinds of these strategies by splitting the initial capital into k equal 
parts and applying the corresponding strategy for each of them. Let /C„ denote the 
resulting capital process. Then limsup„/C„ = oo, if for any the relative frequency of 
any of x^, x^^-^^, . . . , a;^^^,_i {1 < m < n) taking the value does not converge to 

It is of interest to compare the growth rates of ICn'^ for different block lengths k. 
Note that block pattern strategy of length k also depends on the shift a. If the empirical 
distribution of {sk} is different for a, then the main growth rate of a block strategy depends 
also on a. For simplicity we mainly consider the case that the empirical distribution Fn^a of 
{Xm+a}m=i t^LC Same for different shifts a, in the sense that the total variation distance 
between a and -Fn,a' converges to for all a ^ a' . We call this case "homogeneous 
with respect to the shifts" . We show in Section 13.31 that the main growth rate of ffTTl) is 
non-decreasing in k under the assumption of homogeneity with respect to the shifts. 

3.2 Markovian patterns 

We can construct another strategy which exploits non-randomness in the moves of Reality. 
Such a procedure can be given as 



where M+ and M~ can have different values. This is a first-order Markovian strategy, 
which incorporates the information on the last move of Reality. As in the previous 
subsection, for clarity of presentation we first consider the first-order Markovian strategy 
and then extend it to higher-order Markovian strategies. 

Let q\ = Sn and Qn = ^ ~ ^n- We also denote the numbers of pairs = 



(11), (10), (01), (00), ^ = 2,...,n by g^S g^^ g^i^ ^oo^ respectively. (For n = 1, let 



= ql^ = ql^ = Qi^ = Qi^-) We take the beta- binomial distribution with parameters 
a,b > for Q{xi \ Xi-i = 1) and Q{xi \ Xi-i = 0), i > 2. (The initial distribution is taken 
as Q(l) = p = 1 — Q(0).) Then the corresponding strategy for i > 2 is given by 



m+a 



{-Ck{m-l)+l+a^ -Ck(m-l)+2+a-, • • • ; ^km+a)-, 1 <: TTl <■ Tl, O — 1, . . . , — 1. 




M. 



Pi -P 
P(l-P) 



Pi -P 
P(l-P) 
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with 



Pi 



ql-i + « 



Pi 



qi-i + a 



For 2 = 1, we let = = 0. The capital /C,„ = JCn for this strategy is given by 



C-i + q^'li + a + b 



ICr 



r(a + 6)r(gii + a)r(gio + 6) 



X 



r(a + 6)r(gOi + a)r(gOO + 6) 



r(a)r(6)r(gii + gio + a + 6)p9i' (1 - p)i^^ " r(a)r(6)r(gOi + C + a + 6)pe (1 - p)</Slo 

r(a + bfriql' + a)r(g^° + &)r(g°^ + a)r(g°° + &) 
r(a)2r(6)2r(gii + gi" + a + 6)r(gOi + + « + 6)p'?i^+'??.^ (1 - p)"^ 



We evaluate the asymptotic behavior of this capital process. Write g^/n = Pn = Xn, 
q^ Iqli = '^n' /q^ = ''^n- Then as in (IHl) we have 



log/C„ = np„D(r^||p) + n(l - p„)D(r° 



O(logn). 



(12) 



Hence if either limsup^ — p| > or limsup^ |r° — p| > then limsup^ /C^ = oo. 

Now we compare ( |T2l) to the capital process (E]) of the strategy based on s„ only. By 
counting the number of pairs we have + q^ = q\ — xi and 

Pnvl + (1 - pn)rl = ^ + ^=p„-^=p„- 0(l/n). 

n n n 

Then by ([9]) with A = ]3„ we have 



log JCn - log /C° 



n 



n 



p„D(r^||p) + (1 - Pn)D{r'jp) - D(p„||p)J - O(logn) 
PnDir'JPn) + (1 -p„)D(r°||p„)l - O(logn). (13) 



We again see that log/C^ exploits the first-order Markovian non-randomness more effec- 
tively than log/C° by the amount given above. 

The above first-order procedure can be extended to the k-th order procedure based 
on sequence = a;„_fc • • • of length k preceding x„. We hereafter outline the 

procedure. 

Let g^* denote the number of consecutive fc-tuple in = For [k + 1)- 

tuples efcl = ei . . . efcl and e^O = ei, . . . , efcO we similarly define q^^ and g^*". We take the 
beta-binomial distribution for Q{xi\x^_i = e^), i > k + 1. (The initial distribution Q{C,'^) 
up to round k is taken as the risk neutral measure.) The corresponding strategy is 



Mi = 0, l<i<k, Mi = M^" = /Ci_i 



Pi - P 



if x'y_-^ = efc, i> k + 1, 



with 



Pi 



6fe 



g>i + a 



g^ 'i + g^'i + a + 



a, 6 > 0. 
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The capital process /C„ = ICn'^ for this strategy is written as 

r(a + 6)r(g^'^i + a)r(g^^o + 6) 



efce{0,l} 



■ , r(a)r(6)r(g^^ + a + h)p'i-' (1 - p)^" 



T{arV{hr J^l^, r(g- + a + h)/^^! - p)''^^" ' 

We evaluate the asymptotic behavior of this capital. Denote p^^'*^'' = /n, = /q^ . 
Note that p^^'^'^ differs from in (ITTl) . because the latter only looks at the relative 
frequency of ek among non-overlapping blocks of length k. As n — cx), we have 

log;C„ = n Vn'''D{Tl-\\p)-0{\ogn). (14) 

Hence if limsup„ \r^'' — p| > for any e^, then limsup„/Cn = oo. In the next section we 
show that the first term on the right-hand side of ( fT4l) is non-decreasing in k. 



3.3 Relations between block strategies and Markovian strate- 
gies 

In ffTOl) and f|T3|) we saw that the block strategy of length two and the first-order Markovian 
strategy have better main growth rates than the strategy based s„ only. In this section 
we give results on the comparison of main growth rates for general k, which is the block 
size for block strategies and the order for Markovian strategies. 

For Markovian strategies we show that a larger k gives a better growth rate. Con- 
cerning block strategies we show that the same result holds under the assumption of 
homogeneity with respect to shifts and furthermore that Markovian strategy of order 
k — 1 gives a better growth rate than the block strategy of length k. 

We first consider Markovian strategies. Let ICff'^ denote the capital process of fc-th 
order Markovian strategy. In ^\M . for a given Ck-i = €2 ■ ■ - ek consider the sum of two 
terms involving Isk-i = le2 . . . Cfc and Oek-i = 0e2 . . . e^. We have 

where 0{l/n) is due to the counting problem at the end of the sequence Also 

Then by 

= pf ^^-iD(ri^-^||<-0 +Pf + 0(l/n). (15) 
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Summing up over Sk-i G {0, 1}^ ^ we have 
log/Cf'^--log/Cf'=-i 

= n Yl +Pf °^'="^^(r°^'=-ir^'=-)] - O(logn). 

(16) 

Therefore the growth rate of the Markovian strategy of order k is larger than that of order 
/c — 1 by the amount shown on the right-hand side. 

Next we consider block strategies. For j = 0, 1, define 




Then 

6fee{o,i}'= ^ e,g{o,i}fc-i j=o 1^ e^e{o,i}fe-i ^ 

Now pn^'''^ = rtt'^ and under the the assumption of homogeneity with respect to shifts, 
the relative frequency of is the same for different shifts a and this implies that 
p^n~^ = pn '^'"^ + oil). Therefore 

D{{P'r^} II {p^^}) = Pn''''^'D{rr'\\p)+Dmr} II {p^-^}) +o(l). 

By induction on k we obtain 

k-i 

fclogC' = log/Cf'"' + (A: - 1) log/Cf ''^-^ + o(n) = log/Cf ^ + o{n), 

i=Q 

or 

^ fc-i 

^og}C^'' = ^Y.^og)C^'^ + o{n), (17) 

i=0 

where /C*^'° = /C° is the capital process of the strategy based on s„ only. In f|T6|) we saw 
that the growth rate of Markovian strategy is non-decreasing in A;. It follows that under 
the assumption of homogeneity with respect to shifts the growth rate of the block strategy 
is also non-decreasing in k and furthermore the growth rate of the Markovian strategy of 
order — 1 is better than that of the block strategy of length k. 

We should note that if the homogeneity does not hold, then the growth rate of the 
block strategy of length k might be better than that of the Markovian strategy of order 
k — 1. If we divide the initial capital into k equal parts corresponding to each shift, then 
the combined capital process is the arithmetic average of the capital process for different 
shifts. The growth rate of the combined capital equals the maximum of capital processes 
for different shifts and the maximum might be better than that of the Markovian strategy 
or order k — 1. The question of homogeneity comes up again in consideration of the asset 
trading game in continuous time. 
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3.4 Universal Bayesian Skeptic by mixture of priors 

When we incorporate the strategies developed in the previous subsections, we get a strat- 
egy which can exploit any block or Markovian patterns of any length deviating from 
independent Bernoulli trials. 

Let V^'^ be the Bayesian strategy which exploits the /c-th order block patterns con- 
structed in Section 13.11 and let "P^'^ be the Bayesian strategy which exploits the fc-th 
order Markovian patterns constructed in Section 13. 2[ At first we divide the initial capital 
/Co = 1 into equal two parts /Cqb = 1/2 and /Cqa/ = 1/2, and further divide /Cqb = 1/2 
into countably infinite accounts with positive initial capitals cbi, cb2, • • • , YlkLi ^Bk = 1/2, 
and also divide /Cqm = 1/2 similarly as ca/i, cm25 • • • , Yl'k=i^^ik = 1/2. We apply the 
strategy V^'^ to the k-th. account cbu and apply the strategy V^^'^ to the k-th. account 
CMk, respectively. The resulting "universal" strategy 

oo oo 
k=l k=l 

can exploit any block or Markovian pattern of any length. 

We shall formulate and state this fact within the framework of measure-theoretic 
probability in order to clarify the connection to the universal source coding in information 
theory. For simplicity of statement we consider the coin-tossing game with p = 1/2 and 
use the base two logarithm. Let {p^''} denote the fc-dimensional probability distribution 
of a random vector (Xi, . . . , X^) G {0, 1}^. Then 

^D({p^nii{p^n)= E p''''^og,^^ = k-H{x„...,x,), 

where H{Xi, . . . , X^) denotes the entropy of {p^''}. For an infinite sequence Xi, X2, . . . 
of stationary and ergodic 0-1 random variables, the entropy H{X) = H{Xi, X2, . . .) 
is defined as H{X) = \imk{l/k)H{Xi, . . . , Hk). Under the assumption of stationarity 
and ergodicity, each fc-dimensional empirical distribution converges to the probability 
distribution {p^''} almost surely. Furthermore in the previous subsection we saw that 
larger block sizes achieve better growth rates. Therefore, arguing as in Chapter 13 of |Ij 
we have the following proposition. 

Proposition 3.1. //Xi,X2, . . . , are stationary and ergodic sequence of 0-1 random vari- 
ables, then 

— \0g2JCJ! ^1—H(X), a.s. (n^oo). 
n 

Thus we can say that /C^* 00 achieves the optimal rate in the sense of universal 
source coding in information theory. 
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4 Application to asset trading games in continuous 
time 



In this section we apply the results on the block strategy of length two and the first-order 
Markovian strategy of the previous section to an asset trading game in continuous time 
considered in [T6]. It should be noted that our interest here is to derive explicit growth 
rates of our strategies applied to the asset trading game, rather than a rigorous treatment 
of forcing of the variation exponent of two. Therefore in our derivation we proceed with 
informal definitions and convenient regularity conditions. 

In Section 14.11 we summarize the setup of an asset trading game. In Section 14.21 
we obtain growth rates of our strategies for the asset trading game when the variation 
exponent of the asset price path deviates from two. 



4.1 Preliminaries on asset trading games in continuous time 

Here we summarize preliminary facts on the asset trading game as formulated in ^L6^. Our 
framework in [16] is now much generalized in the recent papers of Vovk ( [17] , [18] , [19] ) . 

Suppose that there is a financial asset which is traded in continuous time. Let S{t) 
denote the price of the unit amount of the asset at time t. We assume that S{t) is positive 
and a continuous function of t. The price path S{-) is chosen by a player "Market", 
which is the same as Reality in the coin-tossing game. "Investor" enters the market at 
time t = to = with the initial capital of /C(0) = 1. He decides discrete time points 
= to < ^1 < ^2 < ■ ■ ■ to trade the financial asset. The trading time tj and the amount 
Mj of the asset Investor holds for the interval can depend on the path of S{t) up 

to time tj. 

The basic fact on the behavior of S{t) is the "-\/di-effect" ([20], [IS]), which asserts that 
infinitesimal increments |dS'(t)| of the price path have to be of the order 0{Vdt), in the 
sense that otherwise Investor can make arbitrarily large profit without risking bankruptcy. 
When |dS'(t)| = 0{{dt)^), then H G (0, 1] is called the Holder exponent or the Hurst index 
of S{t), and 1/H is called the variation exponent. Thus the game-theoretic statement of 
v^-effect is that Investor can force the variation exponent of two. 

We consider "limit order" strategy of Investor. Let 5 > be a constant. Investor 
determines the trading times ti,t2, ■ ■ ■ as follows. After U is determined, let tj+i be the 
first time after ti when either 

— 1 + or — ' ^ ° ^ 



S{ti) 1 + 5 

happens. This procedure leads to a discrete time coin-tossing game embedded into the 
asset trading game as follows. Let 

_ {l + 5)S{tn+l)-S{tn) _ [l, ifS{tn+l)=S{tn){l+S) 

6{2 + 6)S{t„) lO, if^(Wi)=S(t„)/(l + 5), 
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and 



Also write /C„ = K,{tn+i)- Clearly be thought as "heads" or "tails" chosen by 

Market after the time tj. More formally we define the following protocol of an embedded 
discrete time coin-tossing game. 

Embedded Discrete Time Coin-Tossing Game 
Protocol: 
^0 := 1. 

FORn= 1,2,...: 

Investor announces Vn G 1^- 
Market announces x„ G {0, 1}. 

END FOR 

This embedded discrete time game allows us to apply results on coin-tossing games 
to the asset trading game in continuous time. The amount Mj of the asset held during 
[ti, tj+i) is determined by our strategies in Section [31 From now on we fix the time interval 
[0,T] of the asset trading game. Then the total number rounds n = n{6) played in the 
embedded coin-tossing game is finite. For a given path, n{6) is increased by letting 5 | 
in ( ITSl) . n{6) diverges to infinity as 5 | 0, unless S(t) is constant on [0,T]. We call 
a strategy with small S a high-frequency strategy. In [16] we applied high-frequency 
Bayesian strategy of [7] to the embedded discrete time game and proved that Investor can 
force the variation exponent of two, within an arbitrary small constant. 

However the Bayesian strategy of [7] is based only on s„ and does not take the higher 
order patterns of the increments of S{t) into account. In the previous section we saw 
that multistep Bayesian strategies can effectively exploit higher order patterns when Re- 
ality's moves are not random. Therefore it is of interest to investigate how fast Investor 
can increase his capital by a high-frequency multistep Bayesian strategy, if the variation 
exponent of S{t) deviates from two. 

4.2 Growth rates of block strategy of length two and first-order 
Markov strategy for asset trading game 

In this section we derive growth rates of high-frequency block strategy of length two and 
first-order Markov strategy in the embedded coin-tossing game. Our results are stated in 
two propositions at the end of this section. 

Write r] = log(l + 6). Then | is equivalent to 5 ]. 0. We decrease rj to zero as 

Vk = 2-\ fc = l,2,.... 

The advantage of taking this sequence of rjk is that the equi-spaced grids for log S{t) 
are completely nested in k and we can establish some important relations between the 
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empirical distributions of block patterns for different k (see Lemma [4.11 below). We call 
the embedded coin-tossing game with r/^ the k-th embedded coin-tossing game. Let n{7]k) 
denote the total number of rounds of the k-th embedded coin-tossing game. For notational 
convenience we sometimes write n or Uk instead of n{r]k)- As in Section [XTl let '^^j./2' 
i,j = 0, 1, denote the number of pairs (ij) among {xiX2){x3X4) ... in the k-th embedded 
coin-tossing game. For the shift of one, let denote the number of pairs [ij) among 

(x2X3)(x4X5) ... in the k-th embedded coin-tossing game. For these counts of pairs, it 
is more precise to write m^^^^^] "^[(nfc-i)/2]- However for simplicity we write m^n^./2 '^^ 
m^^^2 ill the following. We define qn^,ql^^ as in Section [X^ 

We now give a preliminary consideration on the behavior of counts n{r]k), ql^^, ql^^, 
different k. At this point it is helpful to consider properties of fractional 
Brownian motion (Chapter 4 of [3j). Let {Bnit)} denote the fractional Brownian motion 
of Hurst index H. Buif) corresponds to logS'(t) in the asset trading game. Buit) is 
a typical stochastic process with \dBH{t)\ = 0{{dt)^). {BH^t)} is self-similar, i.e., for 
every a > the distribution of {Bniat)} coincides with that of {a^Bnit)}. This implies 
that making the grid finer a.s rjk ^ i]k/2 is equivalent (in distribution) to increasing T as 
T This suggests that when Market chooses a path S{t) with a fixed exponent 

if, then 

n,+i ~ 2V^n,. (19) 

Furthermore {BH{t)} has stationary increments, i.e., the distribution of the increments 
of {Bnit)} are invariant with respect to arbitrary time shift. This corresponds to our as- 
sumption of homogeneity with respect to the shifts in Section [3^ Under the homogeneity 
assumption we expect 

2m'^ ,„ ~ 2m'^' ,„ ~ q^J . (20) 
Also note the following trivial combinatorial relations for any n: 

Q^n + (fn = 9n - + = ql " Xn and |g°^ - I = k„ - Xi | < 1. 

From game-theoretic viewpoint Investor can force H = 1/2. If Market chooses a path 
with H 7^ 1/2, then the notion of forcing can not be applied and there is no guarantee that 
f fT9|) and fl20l) hold. However even for if 7^ 1/2 we use f lT9|) . fl20l) as convenient regularity 
conditions for evaluating the growth rates of our strategies in view of the properties of 
the fractional Brownian motion. In Proposition 14.11 and Proposition 14.21 below we obtain 
the growth rate of the first-order Markovian strategy and block strategy of length two 
under these conditions. In these propositions the approximate equalities in (fT9l) and (1201) 
are understood in the sense that the ratios two sides converge to 1. 

Now we state the following crucial combinatorial fact. 

Lemma 4.1. For each path S{t) and for each k 

™ii _ „i ™00 _ 
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Proof. Consider two nested equi-spaced grids with intervals rjk-i and rjk = rik-i/2. For the 
grid with the interval r/fc-i, consider a step, where the price is going upward from ti to tj+i 
i.e. log5'(ti+i) = log S{ti) + Tjk-i in f|T8|) . It is obvious that this upward step corresponds 
exactly to two consecutive upward steps for the the grid with the interval r]^. Therefore 
~ ^Wfe-i' counting downward steps, we similarly obtain ?^°°/2 ~ ^n^-i- '-' 

We now derive the growth rate of the first-order Markovian strategy. Define 

TV{rik,T) = I log^(ti) - logS(ti_i)| = UkTjk = {ql^ + g°Jr/fc, 
1=1 

L(r^,,T) = log5(t.J - log 5(0) = {ql^ - g°Jr/,. 

Under (fT9l) . nk+irjk+i — 2^/^~^nk'r]k- Therefore Uk+iVk+i — > c)o as A; — > oo for the case 
H < 1. On the other hand S(tnJ S(T) as — cx) and L{rik, T) — >• log S(T) — log S{0). 
Therefore for each path, L{r]k,T)/TV{rijs,T) — as A; — >■ cx) and this implies that 



(A;-^oo). (21) 



Also note that 1/2 = lim^^o Ps- 

Furthermore by Lemma [4.11 under f|T9|) 

Therefore 

1 9nfc '^fc-l 1 



Similarly = q^^/q^^. — 1 — 1/2^/-^"^. Let /C*^ denote the capital of the first-order 
Markovian strategy at the end of the fc-th embedded coin-tossing game. By f|T2|) we 
obtain the following proposition. 

Proposition 4.1. Suppose that Market chooses a path such that 

1- "^k+i 2m^^ /2 
1 = lim ^ = lim = hm J = 0, 1. 



Then 



We now consider the block strategy of length two. Let K.^^ denote the capital of block 
strategy at the end of the k-th embedded coin-tossing game. By ( |T71) we know that log/C^^ 
is the average of log/C*^'^ and log/C^^'°. However by (12T!) the growth rate of /C^'° is zero. 
Therefore we have the following result. 
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Proposition 4.2. Under the same assumption as in Proposition \4- 

1 



lim — log/Cf = -d(—^ -\ 



Therefore the growth rate of the block strategy is half of rate of the Markovian strategy. 
/C^'" is the capital process of the strategy based only on the past average of Reality's moves 
considered in 



whose growth rate is zero. It is of interest to note that despite this 
zero growth rate, the strategy in p!6] was sufficient to force the variation exponent of two 
of the Market's path. This suggests that looking for a simple strategy for forcing certain 
event and looking for a more aggressive strategy with a better growth rate need different 
considerations. 



5 Discussions 

In this paper we studied multistep Bayesian strategies for coin-tossing games. Our general 
conclusion was that asymptotically we obtain better growth rates by incorporating larger 
block sizes for block strategies or longer orders for Markovian strategies. However this 
conclusion has to be taken with the following cautions. When the main growth term 
expressed in terms of the Kullback divergence is close to zero, we have to compare this 
to the term of order 0(logr;,). Generally the term of order O(logn) can be understood 
as a penalty term for larger models, i.e., for using strategies incorporating larger blocks. 
Therefore if the coin-tossing game is played only a finite number of rounds, or Reality does 
not deviate too much from the independent Bernoulli trials, then it might be advantageous 
to use shorter block sizes. This is essentially the same tradeoff as in statistical model 
selection based on various information criteria. It is of great interest to consider selecting 
among strategies or dynamically adjusting weights for them. 

For convenience we made the assumption of homogeneity for block strategies in Section 
13.31 and in Section HI We initially thought that homogeneity can be "forced" on Reality 
by appropriate strategies of Skeptic. However, when Reality deviates from independent 
Bernoulli trials, the game-theoretic notion of forcing can not be applied. Intuitively it 
seems that Skeptic can further exploit patterns in Reality's moves when the homogeneity 
with respect to shifts does not hold. However at present it seems difficult to formulate 
results in this direction. 

In Section 13.41 we considered an infinite countable mixture of block strategies and 
Markovian strategies. Using this countable mixture. Skeptic can asymptotically exploit 
any deviation of Reality's moves from independent Bernoulli trials. We pointed out that 
the idea of the universal source coding in information theory is similar. Our result is 
also very closely connected to results in algorithmic theory of randomness. We can think 
of each component strategy as a test of randomness of Reality's moves. In algorithmic 
randomness there are strong computability restrictions on the allowed sample spaces. 
In the game-theoretic approach we do not have to worry about computability and by 
appropriate discretization it is now possible to discuss the randomness of continuous 
paths. 
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In Section |1]2] we only considered block strategies of length two and first-order Marko- 
vian strategies in the embedded coin-tossing game. We could obtain the explicit descrip- 
tions for the growth rates in Proposition 14.11 and Proposition 14.21 because of the combi- 
natorial fact of Lemma 14.11 It is of interest to investigate growth rates of higher-order 
Markovian strategies in the asset trading game. 

For measure-theoretic stochastic processes, the regularity conditions assumed in Propo- 
sitions 14.11 and 14.21 are basically law of large numbers, and we expect that they hold for 
fractional Brownian motions. However the trading times in (fT8l) are stopping times and 
the fractional Brownian motion for if 7^ 1/2 is not a Markov process. Therefore it is not 
easy to prove that the regularity conditions hold for fractional Brownian motions. 

A Equivalence of Bayesian strategy and prudent strat- 
egy in coin-tossing games 

Here we establish a one-to-one correspondence between Skeptic's prudent strategy and a 
probability distribution on the set of paths S in the coin-tossing game. 

For one direction suppose that Skeptic models Reality's moves by a probability distri- 
bution Q. Write a„ = Mn/JCn-i- Given JCn-i assume that Skeptic tries to maximize the 
conditional expected value of log/C„. It is equivalent to maximizing 



with respect to a„, where p„ = is given in ([3]). The maximizing value of q;„ is uniquely 
given as 



With this a. 



Note that /C„ = if either p„ = and a;„ = 1 or p„ = 1 and x„ = 0. In this case Skeptic 
can not play any more. For other cases he can keep playing the game. It should be noted 
that this is consistent with the definition of conditional probability in ([3]), namely. Skeptic 
can continue the game if and only if ([3]) is defined. We have shown that a probability 
distribution Q leads to the strategy given in (jl]). 

For another direction let V he a. prudent strategy of Skeptic. Starting with the initial 
capital of /Co = 1, define 



Pn log(l + a„(l - p)) + (1 - p„) log(l - a„p) 



(22) 




Qi(l) = p + Mip(l - p) = p(l + Mi(l - p)), 
Qi(0) = 1 - p - Mip(l - p) = (1 - p)(l - Mip). 
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Then Qi(0) and Qi{l) are non-negative and 1 = Qi{0)+Qi{l). For the case }Cn-i{^"' ^) > 
recursively define 

Qn(r-^O) = (1 - p)Qn-l{C-') (l - • 

These are non-negative and satisfy the consistency condition ([2]). If /C„_i(^"~^) = 0, then 
define = = <5n(^"~^0), which is also consistent. By this procedure a Skeptic's 

prudent strategy leads to a probability distribution V ^ Q. 

By construction it is obvious that this map is the inverse map to (jl]) and therefore 
there exists a one-to-one correspondence between the set of probability distributions and 
the set of Skeptic's strategies satisfying the collateral duty. 

Finally we state the following Bayesian optimality result, which follows easily from 
the maximization in ( 122|) 

Proposition A.l. Let Q be a probability distribution on S and let V be the strategy 
corresponding to Q. For any other strategy V 

E'^{\ogKZ^)>E^{}oglct)- 
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